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Abstract 

We apply a quasi-model-independent strategy ("Sleuth") to search for new high pr physics in w 100 
pb _1 of pp collisions at y/s = 1.8 TeV collected by the D0 experiment during 1992-1996 at the 
Fermilab Tevatron. We systematically analyze many exclusive final states and demonstrate sensitivity 
to a variety of models predicting new phenomena at the electroweak scale. No evidence of new high 
Pt physics is observed. 
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It is generally recognized that the standard model, an 
extremely successful description of the fundamental par- 
ticles and their interactions, must be incomplete. Un- 
fortunately, the possibilities beyond the current para- 
digm are sufficiently broad that the first hint could ap- 
pear in any of many different guises. This suggests the 
importance of performing searches that are as model- 
independent as possible. In this Letter we describe a 
search for new physics beyond the standard model, as- 
suming nothing about the expected characteristics of the 
new processes other than that they will produce an ex- 
cess of events at high transverse momentum (pr). An 
explicit prescription ("Sleuth") Jl],^] is applied to many 
exclusive final states jl|-|| in a data sample correspond- 
ing to approximately 100 pb _1 of pp collisions collected 
by the D0 detector g] during 1992-1996 (Run I) at the 
Fermilab Tevatron. 

The data are partitioned into exclusive final states us- 
ing standard criteria that identify isolated and energetic 
electrons (e), muons (/i), and photons (7), as well as jets 
(j), missing transverse energy {Ifir), and the presence of 
W and Z bosons p}. For each exclusive final state, we 
consider a small set of variables given in Table H. The 
notation Pr * s shorthand for p 3 ^ if the final state con- 
tains only one jet, and Y17=2Pt if the final state contains 
n > 2 jets, unless the final state contains only n > 3 jets 
and no other objects, in which case 53<=3 Pr * s use d- Lep- 
tons and fx from reconstructed W or Z bosons are not 
considered separately in the left-hand column. Because 
the muon momentum resolution in Run I was modest, 
we define X] = S Pt f° r events with one or more elec- 
trons and one or more muons, and we determine I^t from 
the transverse energy summed in the calorimeter, which 
includes the pt of electrons, but only a negligible fraction 
of the pr of muons. When there are exactly two objects 
in an event (e.g., one Z boson and one jet), their p T val- 
ues are expected to be nearly equal, and we therefore use 
the average px of the two objects. When there is only 
one object in an event (e.g., a single W boson), we use no 
variables, and simply count the number of such events. 



If the final state includes 


then consider the variable 






one or more charged leptons 




one or more electroweak bosons 


EPt /W/Z 


one or more jets 


EVt 



TABLE I. A quasi-model-independently motivated list of 
interesting variables for any final state. The set of variables to 
consider for any exclusive channel is the union of the variables 
in the second column for each row that pertains to that final 
state. 



The Sleuth algorithm requires as input a data sam- 
ple, a set of events modeling each background process i, 
and the number of background events bi ± Sbi from each 
background process expected in the data sample. From 
these we determine the region 1Z of greatest excess and 



quantify the degree V to which that excess is interest- 
ing. The algorithm itself, applied to each individual final 
state, consists of seven steps: 

(1) We construct a mapping from the c£-dimensional vari- 
able space defined by Table [| into the (i-dimensional unit 
box (i.e., [0, l} d ) that flattens the total background dis- 
tribution. We use this to map the data into the unit 
box. 

(2) We define a "region" R about a set of N data points 
to be the volume within the unit box closer to one of 
the data points in the set than to any of the other data 
points in the sample. The arrangement of data points 
themselves thus determines the regions. A region con- 
taining N data points is called an iV-region. 

(3) Each region contains an expected number of back- 
ground events bji, numerically equal to the volume of the 
region x the total number of background events expect- 
ed, and an associated systematic error Sb^, which varies 
within the unit box according to the systematic errors as- 
signed to each contribution to the background estimate. 
We can therefore compute the probability p N that the 
background in the region fluctuates up to or beyond the 
observed number of events. This probability is the first 
measure of the degree of interest of a particular region. 

(4) The rigorous definition of regions reduces the number 
of candidate regions from infinity to ~ 2 NdtLta . Impos- 
ing explicit criteria on the regions that the algorithm is 
allowed to consider further reduces the number of can- 
didate regions. We apply geometric criteria that favor 
high values in at least one dimension of the unit box, 
and we limit the number of events in a region to fifty. 
The number of remaining candidate regions is still suf- 
ficiently large that an exhaustive search is impractical, 
and a heuristic is employed to search for regions of ex- 
cess. In the course of this search, the iV-region TZn for 
which p N is minimum is determined for each N, and 
Pn = minfl (p N ) is noted. 

(5) In any reasonably-sized data set, there will always be 
regions in which the probability for bn to fluctuate up 
to or above the observed number of events is small. We 
determine the fraction Pn of hypothetical similar experi- 
ments (hse's) in which pjv found for the hse is smaller than 
Pn observed in the data by generating random events 
drawn from the background distribution and computing 
Pat by following steps (l)-(4). 

(6) We define P and iV m i n by P — PN min — mui/v {Pn), 
and identify 1Z = 7£jv min a s the most interesting region in 
this final state. 

(7) We use a second ensemble of hse's to determine the 
fraction V of hse's in which P found in the hse is smaller 
than P observed in the data. The most important out- 
put of the algorithm is this single number P 1 which may 
loosely be said to be the "fraction of hypothetical similar 
experiments in which you would see an excess as inter- 
esting as what you actually saw in the data." V takes on 
values between zero and unity, with values close to zero 
indicating a possible hint of new physics. The computa- 
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tion of V rigorously takes into account the many regions 
that have been considered within this final state. 

The smallest V found in the many different final states 
considered (V m m) determines V, the "fraction of hy- 
pothetical similar experimental runs (hser's) that would 
have produced an excess as interesting as actually ob- 
served in the data," where an hser consists of one hse 
for each final state considered. V is calculated by simu- 
lating an ensemble of hypothetical similar experimental 
runs, and noting the fraction of these hser's in which the 
smallest V found is smaller than the smallest V observed 
in the data. Like V, V takes on values between zero and 
unity, and the potential presence of new high pt physics 
would be indicated by finding V to be small. The differ- 
ence between V and V is that in computing V we account 
for the many final states that have been considered. The 
correspondence between "Pmin and V for the final states 
considered here is shown in Fig. |l](a). 

D0 has previously analyzed several final states ( 2j, 
ee, e$T, W'y, W, Z, Zj, and Wj) t| in a manner simi- 
lar to the strategy used here, but without the benefit of 
Sleuth. No evidence of physics beyond the standard mod- 
el was observed. The final states we describe in this Let- 
ter divide naturally into four sets: those containing one 
electron and one muon (e/iJ); those containing a single 
lcpton, missing transverse energy, and two or more jets 
(VF+jets-like); those containing two same-flavor leptons 
and two or more jets (Z+jets-like); and those in which 
the sum of the number of electrons, muons, and photons 
is >3 [3(e/ M / 7 )X]. 

The efiX data correspond to 108±6 pb 1 of integrat- 
ed luminosity. The data and basic selection criteria are 
identical to those used in the published tt cross section 
analysis for the dilepton channels ||], which include the 
selection of events containing one or more isolated elec- 
trons with pj, > 15 GeV, and one or more isolated muons 
with pj, > 15 GeV. In this Letter all electrons (and pho- 
tons) have |?7dct|< 1-1 or 1.5 <|??dct|< 2.5, and muons 
have | ?7det |< 1-7, unless otherwise indicated 0). The 
dominant backgrounds to the efiX final states are from 
Z/j* — ► tt — ► eiivvvv, and processes that generate a 
true muon and a jet that is misidcntificd as an electron. 
Smaller backgrounds include WW and tt production. 

The W+jetsTike final states include events in both the 
electron and muon channels. The elpT2j{nj) events ||, 
corresponding to 115 ± 6 pb -1 of collider data, have one 
electron with p e T > 20 GeV, $t > 30 GeV, and two or 
more jets with pP T > 20 GeV and | ?7det | < 2-5. The elec- 
tron and missing transverse energy are combined into a 
W boson if 30 < Mf < 110 GeV. The ^$ T 2j(nj) da- 
ta H correspond to 94 ±5 pb -1 of integrated luminosity. 
Events in the final sample must contain one muon with 
p 1 ^ > 25 GeV and |??det|< 0.95, two or more jets with 
p? T > 15 GeV and |r?det|< 2.0 and with the most energet- 
ic jet within |r/dct | < 1-5, and $t > 30 GeV. Because an 
energetic muon's momentum is not well measured in the 
detector, we are unable to separate "W-like" events from 



"non-VT-like" events using the transverse mass, as done 
above in the electron channel. The muon and missing 
transverse energy are therefore always combined into a 
W boson. The W{— ► h$t) 2j( n j) final states are com- 
bined with the W{— * eftx) %j( n j) final states described 
above to form the W2j(nj) final states. The dominant 
background to both the efx 2j(nj) and \iT$t 2j( n j) final 
states is from W + jets production. A few events from 
tt production and scmilcptonic decay are expected in the 
final states W 3j and W4j. 

The Z+jets-like final states also include events in both 
the electron and muon channels. The ee2j(nj) data |1C| ] 
correspond to an integrated luminosity of 123 ± 7 pb . 
Offline event selection requires two electrons with trans- 
verse momenta p^ > 20 GeV and two or more jets with 
pP T > 20 GeV and | f^ct |< 2.5. We use a likelihood 
method to help identify events with significant missing 
transverse energy ||. An electron pair is combined in- 
to a Z boson if 82 < M ee < 100 GeV, unless the event 
contains significant fx or a third charged lepton. The 
/j,li2j(nj) data O] correspond to 94±5 pb -1 of integrat- 
ed luminosity. Events in the final sample contain two or 
more muons with pj, > 20 GeV and at least one muon 
with |?/dct|< 10, and two or more jets with p J T > 20 GeV 
and |?7det|< 2.5. A /i/i pair is combined into a Z boson 
if the muon momenta can be varied within their resolu- 
tions such that m w ~ Mz and $t ~ 0. The dominant 
background to both the ee2j(nj) and fifi2j(nj) data is 
from Drell-Yan production, with Z/j* — ► (ee/ufi). 

Events in the 3(e/ fi/j)X final states are analyzed us- 
ing 123 ± 7 pb -1 of integrated luminosity. All objects 
(electrons, photons, muons, and jets) are required to be 
isolated, to have pr > 15 GeV, and to be within the 
fiducial volume of the detector. Jets are required to have 
\rj\< 2.5. IfjT is identified if its magnitude is larger than 
15 GeV. The dominant backgrounds to many of these 
final states include Z^ and W Z production. 

Refs. [jl],[| provide examples of Sleuth's performance 
on representative signatures. When ignorance of both 
WW and tt is feigned in the e\iX final states, we find 
'Pe/j.^T = 2-4cr and V elJ ,m T 2j = 2.3a in D0 data, correctly 
indicating the presence of WW and tt. When ignorance 
of tt only is feigned, we find V e ^ T 2j = l-9o\ Excess- 
es are observed with only 3.9 WW events expected in 
e/jL^r (with a background of 45.6 events), and only 1.8 tt 
events in e/ifr 2j (with a background of 3.4 events), even 
though Sleuth "knows" nothing about either WW or tt. 
We are able to consistently find indications of the pres- 
ence of WW and tt in an ensemble of mock experiments 
at a similar level of sensitivity. 

In the VF+jets-like final states we again feign ignorance 
of tt in the background estimate, and find 'Pmin > 3<r in 
30% of an ensemble of mock experimental runs on the 
final states W3j, W4j, W5j, and W6j. In the Z+jets- 
like final states we consider a hypothetical signal: a first 
generation scalar leptoquark with a mass of 170 GeV and 
a branching ratio into charged leptons of (3 = 1. In the 
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ee 2j final state 5.9 ±0.8 such leptoquark events would be 
expected with a background of 32 ±4 events. Sleuth finds 
T~'ee2j > 3.5ct in 80% of the mock experiments performed. 
Finally, in the final states 3(e/ fj,/j)X we find that a care- 
ful and systematic definition of final states can result in 
discovery sensitivity with only a few events, independent 
of their kinematics. We conclude from these studies that 
Sleuth is sensitive to a variety of new physics signatures. 

Figure || shows the results of the Sleuth analysis of 
two typical final states (W2j and Z2j). The variable 
space defined by Table | is two-dimensional; parentheses 
are used in the axis labels to indicate the transformed 
variables of the unit box. The circles are individual da- 
ta events, and filled circles define the region selected by 
Sleuth. The regions chosen are seen to correspond to high 
Pt in at least one dimension, as required by the imposed 
criteria. Visually, these regions do not appear to con- 
tain an unusual excess, and large Vs are found. Similar 
results are obtained for other final states. 

Table [n] summarizes the values of V obtained for all 
populated final states analyzed in this article. Taking in- 
to account the many final states (both populated and un- 
populated) that are considered, we find V—0.89, implying 
that 89% of an ensemble of hypothetical similar experi- 
mental runs would have produced a final state with a can- 
didate signal more interesting than the most interesting 
observed in these data. Figure |l|(b) shows a histogram of 
the V values, in units of standard deviations, computed 
for the populated final states analyzed in this article, to- 
gether with the distribution expected from a simulation 
of many mock experimental runs. Good agreement is ob- 
served. We find no evidence of new high px physics in 
these data. 
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48.5±7.6 


39 


0.14 


(+1.08(7) 




13.2±1.5 


13 


0.45 


(+0.13(7) 


e.\i$T 2j 


5.2±0.8 


5 


0.31 


(±0.50cr) 


e^,]pT 3j 


1.3±0.3 


1 


0.71 


(-0.55(7) 




VF+jets-like 








W2j 


400 ± 53 


441 


0.29 


(±0.55cr) 


W3j 


77 ± 10 


67 


0.23 


(±0.74<t) 


WAj 


14.3 ±2.3 


15 


0.53 


(-0.08(7) 


W5j 


1.8 ± 0.4 


1 


0.81 


(— 0.88ct) 


W6j 


0.25 ±0.07 


1 


0.22 


(+0.77(t) 


e$ T 2j 


11.6 ± 1.7 


7 


0.76 


(-0.71(7) 


e]pr 3j 


2.5 ±0.6 


5 


0.17 


(+0.95(7) 


e$ T 4j 


0.80 ±0.24 


2 


0.13 


(+1.13(7) 




Z+jets-like 








Z2j 


98 ± 19 


85 


0.52 


(-0.05a) 


Z3j 


13.2 ±2.7 


12 


0.71 


(-0.55(7) 


Z4j 


1.9 ±0.5 


1 


0.83 


(-0.95(7) 


ee2j 


32 ±4 


32 


0.72 


(-0.58(7) 


ee3j 


4.5 ±0.6 


4 


0.61 


(-0.28a) 


GO -tj 


0.64 ± 0.20 


3 


0.04 


(+1.75ct) 


ee$ T 2j 


3.7 ±0.8 


2 


0.68 


(-0.47a) 




0.45 ±0.13 


1 


0.36 


(+0.36ct) 


eelpT 4j 


0.061 ±0.028 


1 


0.06 


(+1.55(7) 


fifi2j 


0.50 ±0.15 


2 


0.08 


( + 1.41(7) 












eee 


2.6 ± 1.0 


1 


0.89 


(-1.23cr) 


Zl 


4.3 ±0.7 


3 


0.84 


(-0.99a) 


ZlJ 


1.03 ±0.31 


1 


0.63 


(-0.33cr) 


ee"/ 


2.2 ±0.4 


1 


0.88 


(-1.17(7) 


ee^jlpT 


0.26 ±0.10 


1 


0.23 


(+0.74a) 


e77 


10.7 ± 2.1 


6 


0.66 


(-0.41cr) 




2.3 ±0.7 


4 


0.21 


(+0.81cr) 




0.37 ±0.15 


1 


0.30 


(+0.52(7) 




0.21 ±0.08 


1 


0.18 


(+0.92(7) 


777 


2.5 ±0.5 


2 


0.41 


(+0.23a) 


V 






0.89 


(-1.23a) 



(b) 



-2 
P(O) 



TABLE II. Summary of results. The most interesting fi- 
nal state is found to be eeij, with V — 0.04. Upon taking 
into account the many final states we have considered in this 
analysis, we find V = 0.89. 



FIG. 1. (a) The correspondence between V and 'Pmin, each 
expressed in units of standard deviations. The curve reflects 
the number of final states, both populated and unpopulated, 
considered in this Letter, (b) Histogram of the V values com- 
puted for the populated final states considered in this article, 
in units of standard deviations. The distribution agrees well 
with expectation. 
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FIG. 2. Examples of Sleuth's analysis of the final states (a) 
W2j and (b) Z2j. 
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